TRANSCODING METHOD AND APPARATUS THEREFOR 



BACKGROUND OF THE INVENTION 

1 . Field of the Invention 

The present invention relates to a transcoding apparatus for converting a 
digital bitstream complying with a certain compression method into a digital 
bitstream complying with a different (or the same) compression method, and more 
particularly, to a transcoding method for converting a digital bitstream complying 
with a compression method performed using a quantizer without a dead zone into a 
digital bitstream complying with a compression method using a quantizer with the 
dead zone, and an apparatus therefor. 

2. Description of the Related Art 

The moving picture experts group (MPEG)-4 standard has been settled down 
as a compression method essential for the applications of recent Internet videos, 
mobile videos and smart media. Meanwhile, most existing digital video contents 
have been compressed according to the MPEG-1 standard, and the MPEG-2 
standard was selected as a fundamental video compressing method for digital 
television broadcast (or high definition television (HDTV)). Therefore, it is highly 
expected that a large number of digital video contents are compressed according to 
the MPEG-2 standard. Accordingly, to provide a lot of digital video contents to a 
variety of terminals relating to digital videos afterwards, it will be important to 
converting digital video contents encoded according to the existing MPEG-1 or 
MPEG-2 standard into digital video contents complying with the MPEG-4 standard. 

— SUMMARY OF THE INVENTION 

To solve the above problems, it is a first object of the present invention to 
provide a method of converting digital video contents compressed according to the 
moving picture experts group (MPEG)-1 standard or the MPEG-2 standard into 
digital video contents complying with the MPEG-4 standard. 
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It is a second object of the present invention to provide a method of 
converting a discrete cosine transform (DCT) coefficients quantized according to 
quantization without a dead zone into quantized DCT coefficients complying with 
quantization with the dead zone. 

It is a third object of the present invention to provide an apparatus suitable for 
the above methods. 

Accordingly, to achieve the first object of the invention, there is provided a 
transcoding method of performing conversion between compressed bitstreams 
having at least syntax elements and video elements corresponding to video data. 
The transcoding method includes the steps of a) decoding a first bitstream 
compressed according to a first compression method and parsing syntax elements 
and video elements; b) mapping the parsed syntax elements to syntax elements 
complying with a target second compression method; c) partially reconstructing 
video data complying with the first compression method from the parsed video 
elements; d) requantizing the video data reconstructed in the step c) according to 
the second compression method; and e) coding the mapped syntax elements and 
the requantized video data to obtain a bitstream complying with the second 
compression method. 

To achieve the second object of the invention, there is provided a 
requantizing method in which an output y with respect to an input DCT coefficient x 



is expressed by y = g ^ x y = 
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— + — 
A 2 



-A 



a quantization step size Aj is given by 



A/ = Wi ' Q / = o 1 2- • ■ 63 (Qp is a quantization parameter), a decision level t m is 

8 ' 



given by t = / m _ 1\ . A m > j , x m = {x\x e [t m t m+l ]} when x belongs to a section 

m v 2 



Vm, t m+ i\t an amplitude level A m of x m is expressed by x = 



x 1 

A 2 



, an output x' with 



respect to the input DCT coefficient^, which has been quantized by a MPEG-1 



quantizer having a dead zone in which a reconstruction level for x m1 that is, an 
inverse-quantized DCT coefficient r m is given by „ _h a Lis expressed by 
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x' = Q 2 (y) = \ 
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if O is odd 



- 1 if Q is even 



, a quantization step size A' is 



given by ^ _ 2Q , a decision level r '„ is given by^^^.^ ^ > i , 

j/„ = e[t' n j^ +1 ]} when the output >> belongs to a section [/;, and an 
amplitude level of y n , that is, an inverse-quantized DCT coefficient A f n is requantized 



by a MPEG-4 quantizer having a dead zone defined as - 



A' 



and is 



converted into a MPEG-4 DCT coefficient. The requantizing method includes the 
steps of d-1 ) defining subscript values allowing the decision level to belong to a 
section [t m1 t m+I ] as a set p _ e jy t ; d-2) defining candidates of the 

subscript values of the decision level as a set g = p\j{rnm{ P) - 1} where the 

symbol U indicates a union and an operator min{A} indicates a minimum value 
among the members of a set A; and d-3) selecting a member satisfying a cost 
function from among the candidate subscript values as a final subscript value, the 
cost function being expressed by 

min \ m x-p(x)dx 

k = wg ] Jf m ~K\ where C m = tm 



Jp(x)dx 



where C m is an optimum reconstruction level in the section [t m , t m+I ] used by a Lloyd- 
Max quantizer in view of mean square error, and p(x) is a Laplacian distribution 
function. 



To achieve the third object of the invention, there is provided a transcoding 
apparatus of performing conversion between compressed bitstreams having at least 
syntax elements and video elements corresponding to video data. The transcoding 
apparatus includes a decoder for reconstructing syntax elements and video 
elements from a first bitstream complying with a first compression method; an 
inverse quantizer for inverse-quantizing the video elements provided from the 
decoder according to the first compression method to reconstruct video data; a 
quantizer for requantizing the video data according to a second compression 
method; a syntax generator for mapping the syntax elements provided from the 
decoder to syntax elements complying with the second compression method; and an 
encoder for encoding the requantized video data (video elements complying with the 
second compression method) provided from the quantizer and the syntax elements 
provided from the syntax generator according to the second compression method, 
thereby outputting a second bitstream. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The above objectives and advantages of the present invention will become 
more apparent by describing in detail preferred embodiments thereof with reference 
to the attached drawings in which: 

FIG. 1 is a diagram illustrating the concept of a transcoding apparatus which 
is applied to moving picture experts group (MPEG)-1 compression; 

FIGS. 2 and 3 are diagrams illustrating conventional embodiments of the 
transcoding apparatus of FIG. 1 ; 

FIG. 4 is a diagram illustrating the processes of a transcoding method 
according to the present invention, through which a MPEG-1 bitstream is converted 
into a MPEG-4 bitstream; 

FIG. 5 is a flowchart illustrating a method of requatizing DCT coefficients 
according to the present invention; 

FIGS. 6A through 6E are diagrams for explaining the concept of quantization 
according to the present invention; 

FIGS. 7A and 7B are diagrams for explaining requantization according to the 
present invention; 



FIGS. 8A and 8B are graphs illustrating the effects of requantization 
according to the present invention; 

FIGS. 9A through 9C illustrate an original image, an image corresponding to 
bitstreams transcoded according to a conventional method and an image 
corresponding to bitstreams transcoded according to the present invention, 
respectively; 

FIGS. 10 through 12 are graphs illustrating the results of testing a 
transcoding method according to the present invention; and 

FIG. 13 is a block diagram illustrating a transcoding apparatus according to 
the present invention. 

DETAILED DESCRIPTION OF THE PRESENT INVENTION 

FIG. 1 illustrates the example of a transcoding apparatus applied to moving 
picture experts group (MPEG)-1 compression. The transcoding apparatus of FIG. 1 
converts an input MPEG-1 bitstream into a MPEG-1 bit stream having a different 
aspect ratio or a different bit rate. Here, the MPEG-1 bitstream indicates a digital 
video bitstream which has been compressed according to MPEG-1 compression. 

A transcoder 100 includes an internal MPEG-1 decoder 104 and an internal 
MPEG-1 encoder 106. The internal MPEG-1 decoder 104 decodes a MPEG-1 
bitstream provided from an external MPEG-1 encoder 102 to form video data. The 
internal MPEG-1 encoder 106 encodes the video data reconstructed by the internal 
MPEG-1 decoder 104 to form a MPEG-1 bitstream having a different aspect ratio or 
a different bit rate. An external MPEG-1 decoder 108 decodes the MPEG-1 
bitstream output from the internal MPEG-1 encoder 106 to form video data. Due to 
the operation of the transcoder 100, the aspect ratio and the bit rate of a MPEG-1 
bitstream input to the transcoder 100 are different from those of a MPEG-1 bitstream 
oatpottherefrom. 

The external MPEG-1 encoder 102 may be installed in a program provider 
party such as a video-on-demand (VoD) service provider, and the external MPEG-1 
decoder 108 may be installed in a receiving party provided with programs by the 
VoD service provider. In other words, the transcoder 100 converts bitstreams, 



provided by a program provider into bitstreams having a bit rate and an aspect ratio 
suitable for a display unit provided in a receiving party. 

FIGS. 2 and 3 are diagrams illustrating conventional embodiments of the 
transcoding apparatus of FIG. 1 . FIGS. 2 and 3 illustrates an open loop transcoder 
and a closed loop transcoder, respectively. Here, the open loop transcoder 
reconstruct only a portion, for example, a macroblock (MB), of video data from a 
bitstream. On the other hand, the closed loop transcoder reconstructs entire video 
data from a bitstream and encodes the video data. In MPEG compression, a MB is 
typically composed of 4 luminance blocks and 2 color difference blocks. 

Since input and output bitstreams are compressed by the same method or 
have the same syntax, the explanation of the operations of the apparatus of FIGS. 2 
and 3 is restricted to conversion of video elements, in particular, to conversion of 
MBs. Each MB includes a discrete cosine transform (DCT) coefficient encoded 
according to a variable length coding method, a motion vector and MB information. 

The open loop transcoder of FIG. 2 performs conversion in MB units. The 
motion vector and the MB information of each MB are separated from the MB before 
conversion and are united to the MB after the conversion. The DCT coefficient of 
the MB is decoded by a variable length decoder (VLD) 202 and then inverse- 
quantized by an inverse quantizer (IQ1) 204. The IQ1 204 recovers a DCT 
coefficient of each DCT block. A quantizer (Q2) 206 quantizes the DCT coefficient 
of each DCT block recovered by the IQ1 204 to be suitable for a set bit rate. A 
variable length coder (VLC) 208 variable length codes the DCT coefficient quantized 
by the Q2 206. A DCT coefficient output from the VLC 208 is united with the motion 
vector and the MB information which have been separated before conversion. 

The apparatus of FIG. 3 reconstructs entire video data from a MPEG-1 
bitstream and re-encodes the video data to have a desired aspect ratio and bit rate. 
A decoder 300 reconstructs video data from an input MPEG-1 bitstream. The 
reconstructed video data is re-encoded by an encoder 320. A bitstream 
corresponding to a desired aspect ratio and bit rate can be obtained due to the 
operation of the encoder 320. The decoder 300 is a typical MPEG-1 decoder and 
includes an inverse quantizer (IQ1) 302, an inverse DCT converter (IDCT1) 304, an 
adder 306, a memory (MEM1) 308 and a motion compensator (MC1) 310. The 



encoder 320 is a typical MPEG-1 encoder and includes a subtractor 322, a DCT 
converter (DCT2) 324, a quantizer (Q2) 326, an inverse quantizer (IQ2) 328, an 
inverse DCT converter (IDCT2) 330, a subtractor 332, a memory (MEM2) 334 and a 
motion compensator (MC2) 336. 

An open loop transcoder as shown in FIG. 2 demonstrates the excellent 
performance with a very small complexity when a video coding method is an intra- 
picture (l-picture) coding method. However, when a video coding method is an 
inter-picture (P-picture or B-picture) coding method, the open loop transcoder has 
problems in that a reference image changes due to a quantization error changing 
during motion compensation, resulting in the accumulation of errors (referred to as a 
drift). Consequently, a picture quality is degraded. 

A closed loop transcoder as shown in FIG. 3 is faithful to the concept 
illustrated in FIG. 1. An entire bitstream is decoded to reconstruct video data (by 
the decoder 300), and then the video data is re-encoded to generate a bitstream 
having a desired format (by the encoder 320). Such a closed loop transcoder has 
an advantage of stably generating a bitstreams of a desired bit rate, but has 
disadvantages of a large complexity of the apparatus and a large amount of 
calculation. 

In addition, the transcoders of FIGS. 2 and 3 perform a conversion between 
the same encoding methods so that they cannot be applied to MPEG-4 terminals 
which are essential to a variety of multimedia communications and storage relating 
to Internet videos, mobile networks, smart media and the like. 

To overcome these problems, the present invention provides an apparatus 
and method for converting a bitstream generated according to the MPEG-1 or 
MPEG-2 standard into a bitstream generated according to the MPEG-4 standard. In 
addition, the present invention provides an optimum requantization method for 
maintaining an image quality of much as possible during requantization of a DCT 
coefficient. 

Transcoding according to the present invention is performed as follows, 
a) An input MPEG-1 (or MPEG-2) bitstream is variable length decoded, and a 
MPEG-1 (or MPEG-2) syntax element and a DCT coefficient is parsed and extracted 



from the input bitstream. In this specification, description of this process is 
restricted to a video syntax element and DCT coefficient. 

b) The extracted MPEG-1 (or MPEG-2) syntax element is mapped to be 
suitable for the MPEG-4 standard. 

c) The extracted DCT coefficient is inverse-quantized according to the 
MPEG-1 (or MPEG-2) standard. 

d) An inverse-quantized DCT coefficient is requantized according to the 
MPEG-4 standard to meet a desired bit rate. A bit rate is set during this 
requantization. 

e) The syntax element mapped to comply with the MPEG-4 standard and the 
DCT coefficient requantized according to the MPEG-4 standard are variable length 
coded, thereby obtaining a bitstream complying with the MPEG-4 standard. 

FIG. 4 is a diagram illustrating the processes of a transcoding method 
according to the present invention, through which a MPEG-1 bitstream is converted 
into a MPEG-4 bitstream. In FIG. 4, the first row shows detailed processes for the 
parsing of MPEG-1 syntax elements and processes of inverse-quantizing a DCT 
coefficient. The second row shows processes of mapping the MPEG-1 syntax 
elements to MPEG-4 syntax elements. The third row shows processes of forming a 
MPEG-4 bitstream from the MPEG-4 syntax elements and a quantized DCT 
coefficient obtained through requantization. 

In step 402, a sequence header is parsed. In step 404, a group of picture 
(GOP) header is parsed. In step 406, a picture header is parsed. In step 408, a 
slice header is parsed. In step 410, a MB header is parsed. In the steps 402 
through 410, the sequence header, the GOP header, the picture header, the slice 
header and the MB header are sequentially separated from an input MPEG-1 
bitstream. In the MPEG-1 (MPEG-2) standard, an image has a layer structure 
composed of a DCT block, a MB, a slice, a picture and a GOP in the ascending 
order of size. In step 412 of converting an MB to a DCT, a DCT coefficient is 
reconstructed in MB unit. 

In an object layer syntax element mapping step 420, the syntax elements of a 
MPEG-1 sequence layer are mapped to syntax elements vop_width, vopjieight, 
bitjrate, fixed_yop_rate and aspectjratiojnfo of a MPEG-4 sequence layer based 
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on information contained in the parsed MPEG-1 sequence header obtained by 
performing the step 402. In an object plane syntax element mapping step 422, the 
syntax elements of a MPEG-1 object plane layer are mapped to syntax elements 
vop_coding_type, vop_fcode and vop_quant of a MPEG-4 object plane layer based 
on information contained in the parsed MPEG-1 picture header obtained by 
performing the step 406. In a slice layer element mapping step 424, the syntax 
element of a MPEG-1 slice layer is mapped to a syntax element slice_quant of a 
MPEG-4 slice layer based on information contained in the parsed MPEG-1 slice 
header obtained by performing the step 408. In a MB layer element mapping step 
426, the syntax elements of a MPEG-1 MB layer are mapped to syntax elements 
COD, DQUANT, CBPC, CBPY and MV of a MPEG-4 MB layer based on information 
contained in the parsed MB header obtained by performing the step 410. In a 
requantization step 428, the DCT coefficient of the MPEG-1 standard, which is 
obtained by performing the step 412, is requantized according to the MPEG-4 
standard. 

In an object header generating step 430, an object header including the 
MPEG-4 object layer syntax elements vop_width, vopjieight, bitjrate, 
fixed_vopjrate and aspect_ratio_info obtained through the step 420 is generated. 
In an object plane header generating step 432, an object plane header including the 
MPEG-4 object plane syntax elements vop_coding_type, vop_fcode and vop_quant 
obtained through the step 422 is generated. In a MB header generating step 434, a 
MB header including the MPEG-4 MB layer syntax elements COD, DQUANT, CBPC, 
CBPY and MV obtained through the step 426 is generated. In an encoding step 
436, the requantized DCT coefficient obtained through the step 428, the object 
header generated in the step 430, the object plane header generated in the step 
432 and the MB header generated in the step 434 are united and variable length 
cortedrthereby generating a MPEG-4 bitstream. 

In the present invention, a procedure of mapping MPEG-1 (or MPEG-2) 
syntax elements to MPEG-4 syntax elements is as follows. 

b-1) Since a syntax element f_code (indicating the motion range of a motion 
vector) in MPEG-1 has a different meaning from that in MPEG-4, the MPEG-1 



f_code is converted into a MPEG-4 f_code. In Table 1 , the motion ranges indicated 
by motion vectors and codes in MPEG-1 and MPEG-4 are compared. 

Table 1 



MPEG-4 


MPEG-1 
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[-32, 31] 
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[-16, 15] 
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2 


[-32, 31] 
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[-128, 127] 


3 


[-64, 63] 
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[-128, 127] 
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[-512, 511] 
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[-256, 255] 
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[-1024, 1023] 
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[-512, 511] 
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[-2048, 2047] 


7 


[-1024, 1023] 



As shown in Table 1 , in the same motion range, the value of the MPEG-4 
vop_f_code__forward is smaller than the value of the MPEG-1 forward_f_code by 1 . 
In addition, a motion range corresponding to the minimum value 1 of the MPEG-4 
vop_f_code_forward covers a motion range corresponding to the minimum value 1 
of the MPEG-1 forward_f_code. This can be expressed by Equation (1). 

vop f code forward 

= max((forward_f_code-1) I 1) 

where max(a, b) is an operator of selecting a larger value between a and b. 
b-2) A MPEG-1 MB type is converted into a MPEG-4 MB type. 



Table 2 compares the MB types between the MPEG-1 standard and the 
MPEG-4 standard. 
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As shown in Table 2, there are 6 MB types except "stuffing" and "inter+4v" in MPEG- 
4, and there are 8 MB types in MPEG-1. 

The three MB types "nomc+coded", M nomc+coded+q" and M mc+not coded" 
among the MB types in MPEG-1 do not exist in MPEG-4. To realize these 
properties, conversion is performed according to the following rules. 

(i) The MB type "nomc+coded" is set as the "inter" type of MPEG-4, and then 
a motion vector is set to (0, 0). 

^^^(n) The MB type "nomc+coded+q" is set as the M inter+q" type of MPEG-4, and 
then a motion vector is set to (0 7 0). 

(iii) The MB type "mc+not coded" is set as the "inter" type of MPEG-4. A 
motion vector is used as it is. The values of "cbpy" and "cbpc" are set to zero. 

(iv) For skipped MBs, the value of "code" determining "not coded" in MPEG-4 
is set to 0 such as "cod=0" as many times as the skipped MBs. 

b-3) A MPEG-1 CBP is converted into MPEG-4 CBP. 
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Two kinds of information cbpy (coded block pattern of Y where Y is a 
luminance signal) and cbpc (coded block pattern of C where C is a color signal) in 
MPEG-4 are united in one syntax element cbp in MPEG-1. The syntax element cbp 
in MPEG-1 indicates the existence/non-existence of a DCT block including a non- 
zero DCT coefficient for the DCT blocks included in a MB, and has as many bits as 
the DCT blocks included in the MB. Each bit indicates whether there is any non- 
zero DCT coefficient in a corresponding DCT block. Typically, a MB is composed of 
4 luminance DCT blocks and 2 color difference DCT blocks so that the syntax 
element cbp is represented with 6 bits. 

The syntax element cbpy in MPEG-4 corresponds to bits (4 bits) 
corresponding to the luminance DCT blocks in the MPEG-1 cbp, and the syntax 
element cbpc in MPEG-4 corresponds to bits (2 bits) corresponding to the color 
difference DCT blocks in the MPEG-1 cbp. 

The syntax element cbpy is individually coded, and this can be expressed by 
Equation (2). 

cbpy = (cbp&0x3c) » 2 ... (2) 

where "&" indicates an AND operation performed in bit unit, "0x3c" indicates "3c" of 
a hexadecimal number, and "»n" indicates an n-bit right shift operation. 
The syntax element cbpc can be expressed by Equation (3). 

cbpc = (cbp&0x03) » 2 ... (3) 

The syntax element cbpc is united with the MB type obtained in the above step (b-2) 
and coded to comply with the mcbpc VLC table of corresponding MPEG-4 l-VOP 
and P-VOP. 

b-4) The value of a MPEG-1 MQUANT is converted into a value of a MPEG-4 
DQUANT. 

The value of the MPEG-4 DQUANT is the difference between adjacent MBs 
in a quantization parameter (QUANT). QUANT values exist in the range of [1 , 31] 
and should satisfy a relation, \Q n - Q n _ x \< 2 . Converting a MPEG-1 MQUANT value 

into a MPEG-4 DQUANT value can be expressed by Equation (4). 
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dquant 

= min (max((mquant of current MB-mquant of previous MB), -2), 2) 



where "mquant" indicates a MPEG-1 quantization parameter. 

When fully reflecting the range of a current mquant to ensure accuracy much 
more, a resynchronization maker according to the MPEG-4 standard can be used. 

In MPEG-1 and MPEG-4, a quantization step size varies with a quantization 
weight as well as a quantization parameter. Usually, a MPEG-1 quantization step 
size A is always larger than a MPEG-4 quantization step size A'. Accordingly, when 
requantizing a DCT coefficient reconstructed from a MPEG-1 bitstream according to 
the MPEG-4 standard, disagreement between MPEG-1 and MPEG-4 in a 
quantization size should be appropriately handled to maximize the quality of an 
image reconstructed from a converted MPEG-4 bitstream. 

FIG. 5 is a flowchart illustrating a method of requatizing DCT coefficients 
according to the present invention. A procedure of requantizing a DCT coefficient in 
a transcoding method according to the present invention will be described with 
reference to FIG. 5. In step 502, a DCT coefficient is reconstructed from a MPEG-1 
bitstream in MB unit. In step 504, the Laplacian distribution of the DCT coefficient is 
estimated. In step 506, a reconstruction level is determined using the estimated 
Laplacian distribution characteristics of the DCT coefficient. In step 508, 
quantization is performed according to MPEG-4 using the determined reconstruction 
level. 

FIGS. 6A through 6E are diagrams for explaining the concept of quantization 
according to the present invention. FIG. 6A illustrates a MPEG-1 quantizer. FIG. 
6B illustrates quantization weighting factors Wi of a fundamental quantization table 
foraltrrninance component used in quantizing an intra MB. Based on the 
quantization weighting factors of the quantization table, a quantization step size is 
determined as follows. 

Wi-Q 
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where Q p is a quantization parameter. 

In FIG. 6A, the relation between the input x and output^ of the quantizer with 
respect to an arbitrary DCT coefficient can be expressed as follows. 



X 1 

— + — 
A 2 



...(6) 



where an operator |_aj indicates an integer most approximate to "a". 
A decision level t m is given by Equation (7). 



^= <>--)' A, m>\ 



...(7) 



When x belongs to a range [t mt t m+1 ], x m = {x\x e[t m t m+l ]} . An amplitude level A m of 
x m is given by Equation (8). 




From Equations (8) and (6), x m is mapped to an m-th reconstruction level 
according to Equation (9). 



An input x having a negative value has the following quantization mapping 
relation. 



FIG. 6C illustrates the quantizer characteristics for a MPEG-1 intra MB. In 
FIG. 6C, a unit on the vertical and horizontal axes "x" and "y" is a quantization step 
size, A. Here, a MPEG-1 quantizer is a uniform quantizer and does not have a dead 
zone. In other words, the length of a section including the origin is the same as that 
of any other section on the vertical axis. 

FIG. 6D illustrates a MPEG-4 quantizer. The output x' of the MPEG-4 
quantizer with respect to an input y is given by Equation (11). 



'«=LV A J 



...(9) 



...(10) 
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A' 



y 



A' 



y 




- 1 if Q p is even 



if Q is odd 



-(11) 



A quantization step size A' is given by Equation (12). 



...(12) 



where 0^ is a quantization parameter. 

A decision level t' n is given by Equation (13). 



t' n =n*A\ n>\ 



-(13) 



When the output y belongs to a range [*'„, y n = {y\y ^[t' n j' n+l ]} . An amplitude 
level A ' n of y n is given by Equation (14). 



The characteristics of the MPEG-4 quantizer are shown in FIG. 6E. In FIG. 
6E, the existence of a dead zone is confirmed. In other words, the length of a 
section including the origin is different from that of any other section on the vertical 
axis. 

In Equation (5), a MPEG-1 quantization step size A is weighted with a 
quantization table value Wi in addition to a quantization parameter Q r On the other 
hand, in the case of MPEG-4, the quantization parameter Q p is doubled according to 
Equation i (12), and the resultant value is set as a quantization step size A'. 

Accordingly, except the case where the quantization table value Wi is 16, a 
MPEG-1 quantization step size A is always larger than a MPEG-4 quantization step 
size A'. Referring to FIG. 6B, only when a quantization index / is 1 , 8 or 9, a MPEG- 
1 quantization step size is the same as a MPEG-4 quantization step size. Although 
a MPEG-1 quantization step size is the same as a MPEG-4 quantization step size 
when a quantization index / is 1 , 8 or 9, the dead zone of the quantizer Q1 is not the 



-(14) 



A' 
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same as that of the quantizer Q2, as shown in FIGS. 6C and 6E, so that it can be 
appreciated that the two quantizers have different characteristics. 

FIGS. 7A and 7B are diagrams for explaining requantization according to the 
present invention. FIG. 7 A illustrates a consecutive quantization structure in which 
the quantizers Q1 and Q2 of FIGS. 6A and 6D are combined in order to show 
requantization which should be considered for a transcoder. FIG. 7B illustrates the 
quantization characteristics of the quantizers Q1 and Q2 under a usual state where 
a MPEG-1 quantization step size A is larger than a MPEG-4 quantization step size 
A'. When an original DCT coefficient x is [t m1 t m+I ], y=r m in FIG. 7B. When this value 
is input to the quantizer Q2 f r '„ is reconstructed as the value of x ' Here, as shown 
in FIG. 7B, there are three candidate values r r' n and r / n+I of a reconstruction 
level, which can be taken by the quantizer Q2. In other words, distortion can be 
reduced by the quantizer Q2 selecting an optimum reconstruction level. 

As described above, due to the disagreement between the quantizers Q1 and 
Q2 in a dead zone and a quantization step size, the quantizer Q2 has a plurality of 
candidates of an amplitude level and needs to determine an optimum amplitude 
level among the candidates. An optimum amplitude level minimizes distortion and 
the number of generated bits. To determine such an optimum amplitude level, the 
present invention suggests an optimum amplitude level selection algorithm as 
follows. 

Subscript values allowing a decision level to belong to a section [t m , t m+1 ] in 
FIG. 7B are defined by a set P as shown below. 

P = {p\t' P *[t m j m+l ]} ...(15) 

The candidates of the subscript values of the decision level are defined by a 
set K as follows. 

AT-PU{min{P}-l} ...(16) 

where the symbol U indicates a union, and an operator min{A} indicates a minimum 
value among the members of a set A. 

Among the candidate subscript values, the members of the set K , a subscript 
value satisfying the following cost function is selected. 
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mm 



X- p(x)dx 



\C m -rA where C m = 



...(17) 



k <eK 




A amplitude level X f corresponding to the subscript k determined in 
accordance with the cost function expressed by Equation (17) is variable length 
coded. A balance point C m in the cost function of Equation (17) is an optimum 
reconstruction level in a section [t m1 t m+1 ] used by a Lloyd-Max quantizer in view of 
mean square error. Accordingly, the balance point C m can be calculated from the 
distribution p(x) of x. 

However, a transcoder cannot obtain the accurate statistical characteristic of 
the distribution p(x). To solve this problem, it is necessary to previously calculate 
the statistical characteristic of an original DCT coefficient when producing a MPEG- 
1 bitstream and transmit the information with the MPEG-1 bitstream to a transcoder. 
Alternatively, it is necessary to design a transcoder such that it can estimate the 
statistical characteristic of a distribution p(x). The former method is 
disadvantageous in increasing the number of overhead bits for additional 
information. The present invention uses the latter method. 

It is generally known that AC DCT coefficients comply with the following 
Laplacian distribution. 



Here, the statistical characteristic of a distribution p(x) is determined by the value of 
X. In each block, different values of X are assigned to 63 AC coefficients. 

When using Equation (18), an average of a random variable |x| can be given 
by Equation (19). 



Pit) 




...(18) 




From Equation (19), X can be expressed by the following equation. 



1 



...(20) 
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To verify Equation (20), k for a first frame of a flower garden sequence 
according to SIF standards was estimated using Equation (20), and the results are 
shown in Table 3. 

Table 3 





0 


1 


2 


3 


4 


5 


6 


7 


0 




0.019 


0.027 


0.040 


0.057 


0.076 


0.107 


0.170 


1 


0.018 


0.027 


0.035 


0.046 


0.063 


0.088 


0.133 


0.208 


2 


0.021 


0.029 


0.037 


0.049 


0.066 


0.092 


0.133 


0.220 


3 


0.025 


0.033 


0.040 


0.050 


0.066 


0.094 


0.148 


0.255 


4 


0.028 


0.034 


0.044 


0.056 


0.075 


0.107 


0.160 


0.267 


5 


0.032 


0.037 


0.046 


0.058 


0.078 


0.115 


0.166 


0.293 


6 


0.036 


0.041 


0.048 


0.062 


0.089 


0.123 


0.173 


0.304 


7 


0.043 


0.047 


0.058 


0.069 


0.093 


0.131 


0.188 


0.313 



FIGS. 8A and 8B are graphs illustrating the accuracy in estimating the value 
of A according to the present invention. To confirm the results shown in Table 3, a 
Laplacian distribution when A=0.046 is compared with an actual distribution for a 
coefficient (1, 3) in FIG. 8A, and a Laplacian distribution when A=0.166 is compared 
with an actual distribution for a coefficient (5, 6) in FIG. 8B. It can be seen that the 
value of X is smaller as an AC component becomes lower in frequency. This is 
because the probability that DCT coefficients having larger values exist at lower 
frequency is high due to the energy packing characteristics of DCT. 

Although the value of A can be calculated from Equation (19), the value of x 
cannot be obtained, but only the value of a reconstructed^ can be obtained in a 
transcoder. Accordingly, it is necessary to approximate the value of E(\x\) 
according to Equation (21). 

E^\) = E(\y\) + E(\z\), (21 

2 

The second term in Equation (21) is added to compensate for an average of the 
values of |x|, which are reconstructed to be zero after quantization because they 
belong to a dead zone, and is defined as Equation (22). 
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Zi 

E( \ z \\ =\\\ z [p( z ) dz 



...(22) 



The value oip{z) necessary for calculating the second term in Equation (21) 
is given by Equation (23). 



p(z) = — -e where X f — — ]— — 
2 E(\y\) 



...(23) 



J?(|z|) A in Equation (23) is calculated according to Equation (24). 



E(\z\) A = 2-J" 2 z- — •e ri2 dz = 



X' X' 2 



...(24) 
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Accordingly, the value of X estimated by a transcoder according to the 
present invention is expressed as follows. 



1 



1 



X' 



E(\x\) E(\y\)+E(\z\) 



2-e- A ' A/2 (l + -^') 
2 



...(25) 



If a dead zone does not exist, A=0 in Equation (25) so that X=X'. 

To verify the validity of Equation (25), a test was performed on a first flower 
garden image complying with the SIF standards. Table 4 shows the values of X for 
each AC coefficient calculated using Equation (25). It can be seen that the 
calculated values of X in the test are very similar to those shown in Table 3, with the 
exception that small errors occurs in values with respect to several high frequency 
components. However, these errors can be neglected. 

Table 4 





0 


1 


2 


3 


4 


5 


6 


7 


0 




0.018 


0.027 


0.038 


0.055 


0.070 


0.197 


0.169 


1 


0.016 


0.027 


0.035 


0.043 


0.069 


0.082 


0.121 


0.178 


2 


0.021 


0.029 


0.036 


0.046 


0.061 


0.083 


0.122 


0.191 


3 


0.025 


0.032 


0.038 


0.047 


0.060 


0.082 


0.153 


0.224 


4 


0.028 


0.033 


0.042 


0.051 


0.065 


0.097 


0.177 


0.250 
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5 


0.031 


0.035 


0.044 


0.053 


0.069 


0.105 


0.196 


0.309 


6 


0.035 


0.039 


0.045 


0.065 


0.076 


0.133 


0.237 


0.415 


7 


0.043 


0.044 


0.051 


0.061 


0.085 


0.177 


0.223 


0.450 



In the case where a MPEG-1 encoder employs a rate control scheme, a 
quantization parameter value is assigned an average of quantization parameters for 
an entire image to calculate A in Equation 6. It is necessary to estimate the value of 
X to use Equation (17), and a delay by one frame occurs when the value of X is 
calculated according to Equation (25). 

FIGS. 9A through 9C illustrate an original image, an image corresponding to 
bitstreams transcoded according to a conventional method and an image 
corresponding to bitstreams transcoded according to the present invention, 
respectively. 

FIGS. 10 through 12 are graphs illustrating the results of testing a 
transcoding method according to the present invention. In a first test, simulation 
was performed on 10 flower garden sequences complying with the SIF standards. 
All frames were encoded in a MPEG-1 intra mode, and the same quantization 
parameter was applied throughout an image. FIG. 10 shows an average peak 
signal-to-noise ratio (PSNR) of a sequence which was transcoded by a method 
proposed depending on changes in the value of Q r For comparison, an average 
PSNR of an input MPEG-1 sequence and an average PSNR of a sequence which 
was transcoded by a conventional method are shown together. PSNR gain is about 
0.3-0.6 dB. Table 5 shows average numbers of bits generated in the above three 
cases. 



Table 5 



Quantization 
parameters 


MPEG-1 coded 


Transcoded 
(conventional) 


Transcoded 
(proposed) 


Bit saving 


2 


297509 


304626 


293659 


10967 (3.60%) 


4 


228122 


241486 


229292 


10012 (4.14%) 


6 


187950 


204710 


191378 


13332 (6.51%) 


8 


159041 


177585 


165223 


12362 (6.96%) 


19 


138606 


158682 


149244 


9438 (5.94%) 


12 


123148 


144156 


135114 


9042 (6.27%) 



20 



14 


111746 


133572 


125123 


8149 (6.10%) 


16 


100669 


123430 


114320 


9110(7.38%) 


18 


92440 


116210 


106748 


9462 (8.14%) 


20 


85294 


109999 


101935 


8064 (7.33%) 


22 


29598 


105182 


97548 


7934 (7.52%) 


24 


74153 


100612 


93008 


7604 (7.55%) 



From Table 5, two interesting results can be inferred. One is that the number 
of bits generated in the proposed method is 3.6-7.5% smaller than that generated in 
the conventional method. This result can be explained based on the distribution of 
DCT coefficients. DCT coefficients are subject to a Laplacian distribution, in which 
a balance point C m having a positive value is on the left of the center between t m and 
t m+I . Since r m is at the center between t m and t m+}1 C m is usually smaller than r m . 
Accordingly, a probability that a smaller reconstruction level is selected increases. 
Since a small reconstruction level is used, the number of bits generated decreases. 

The second interesting result is that the number of bits after transcoding is 
always larger than the number of input MPEG-1 bits. This is because for the same 
quantization parameter, a MPEG-4 quantization step size is always smaller than a 
MPEG-1 quantization step size. Accordingly, MPEG-4 uses a more accurate 
quantization step size, and as a result, the number of bits generated increases. 

A second test was performed on table tennis test sequences under the same 
conditions as the first test. Like FIG. 1 0, average PSNRs are shown in FIG. 1 1 . 
Gain according to the proposed present method is about 0.2-0.4 dB. In Table 6, the 
numbers of bits in the three cases as described in FIG. 10 are compared. It can be 
seen that about 5-7% of the bits conventionally used is saved. 



Table 6 



CnraRTteation 
parameters 


MPEG-1 coded 


Transcoded 
(conventional) 


Transcoded 
(proposed) 


Bit saving 


4 


172314 


197318 


186303 


11015(5.58%) 


6 


124859 


152412 


143593 


8819 (5.78%) 


8 


98753 


125954 


118839 


7115 (5.64%) 


19 


80212 


105236 


97573 


7663 (7.28%) 


12 


67408 


90390 


84308 


6082 (6.72%) 



21 



14 


58334 


72344 


67354 


4990 (6.89%) 


16 


52039 


65004 


61114 


3850 (5.98%) 


18 


45919 


58261 


54391 


3870 (6.61%) 


20 


51684 


53324 


49381 


3943 (7.39%) 


22 


38170 


50124 


46381 


3743 (7.46%) 


24 


35613 


46312 


42785 


3527 (7.61%) 


26 


33328 


43981 


40503 


3478 (7.94%) 



A third test was performed on football sequences complying with the SIF 
standards. Each football sequence had a GOP whose size is 10 and was coded at 
a rate of 1.5 Mbps. FIG. 12 illustrates the PSNRs of 11th, 21st, 31st ... frames 
which are l-pictures. It can be seen from FIG. 12 that the proposed method has gain 
of about 0.3-0.5 dB as compared to the conventional method. The number of bits 
generated according to the conventional method was 195805, and the number of 
bits generated according to the proposed method was 183731. Accordingly, about 
6% of the bits was saved. 

FIG. 13 is a block diagram illustrating a transcoding apparatus according to 
the present invention. Referring to FIG. 13, a transcoding apparatus 1300 performs 
transcoding according to the transcoding method described with reference to FIGS. 
4 through 7B. Thus, the above contents described with reference to FIGS. 4 
through 7B will be referred to for describing the detailed operations of members. 

The transcoding apparatus 1300 includes a variable length decoder (VLD) 
1302, a MPEG-1 inverse quantizer 1304, a MPEG-4 quantizer 1306, a variable 
length coder (VLC) 1308 and a MPEG-4 syntax generator 1310. The VLD 1302 
reconstructs syntax elements and a quantized DCT coefficient from an input MPEG- 
1 bitstream and provides them to the MPEG-4 syntax generator 1310 and the 
MPEG-1 inverse quantizer 1304. The MPEG-1 inverse quantizer 1304 inverse- 
quantizes the quantized DCT coefficient according to MPEG-1 to reconstruct a DCT 
coefficient and provides the reconstructed DCT coefficient to the MPEG-4 quantizer 
1306. The MPEG-4 quantizer 1306 quantizes the reconstructed DCT coefficient 
according to MPEG-4 to obtain a quantized DCT coefficient and provides the 
quantized DCT coefficient to the VLC 1 308. By controlling a quantization step in the 
MPEG-4 quantizer 1306, a bit rate can be changed. 
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Meanwhile, the MPEG-4 syntax generator 1310 maps the syntax elements 
provided from the VLD 1302 to syntax elements complying with MPEG-4 and 
provides the MPEG-4 syntax elements to the VLC 1308. The mapping operation of 
the MPEG-4 syntax generator 1310 has been described in detail with reference to 
FIG. 4 and Tables 1 and 2. 

The VLC 1308 variable length codes the quantized DCT coefficient provided 
from the MPEG-4 quantizer 1306 and the syntax elements provided from the MPEG- 
4 syntax generator 1310 according to the MPEG-4 and outputs the result. A 
bitstream output from the VLC 1308 has been compressed according to the MPEG-4 
and has a desired aspect ratio and bit rate. 

A transcoding apparatus according to the present invention performs 
conversion between bitstream compressed according to different compression 
methods. In the above embodiments, a method and apparatus for converting a 
MPEG-1 bitstream into a MPEG-4 bitstream has been described, but it will be 
apparent that the present invention can be applied to the case of converting a 
MPEG-2 bitstream to a MPEG-4 bitstream and to conversion between bitstreams 
compressed according to the same method. In addition, the present invention can 
be applied to the case of converting a MPEG-4 bitstream to a MPEG-1 or MPEG-2 
bitstream. 

As described above, the present invention facilitates the conversion between 
bitstreams compressed according to different methods or the same method. 
Therefore, a transcoding method and apparatus according to the present invention 
allows existing digital video contents complying with MPEG-1 or MPEG-2 to be used 
in terminals designed to use MPEG-4 digital video contents such as Internet video, 
mobile video and smart media. 
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